WO 2004/005498 



nTn 10/520153 

DT05 Rec'd PCT/PKI 0-3 JMMK 

PCT/US2003/021002^ CUUD 



SEQUENCE LISTING 



<110> MARTIN, ANNETTE 
5 SANGAR, DAVID V. 

LEMON, STANLEY M. 
RIJNBRAND, RENE 

10 <12p> Chimeric GB Virus B (GBV-B) 

<130> UTFG:258WO 

<140>. UNKNOWN 
15 <141> 2003-07-02 

<150> 10/189.359 
<151> 2002-07-03 

20 <160> 16 



25 



30 



35 



<170> Patentln Ver. 2.1 

<210> 1 
<211> 260 
<212> RNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: 



GB VIRUS B 



<400> 1 

gaguuugcga ccauggugga ucagaaccgu uucgggugaa gccauggucu gaaggggaug 60 
acgucccuuc uggcucaucc acaaaaaccg ucucgggugg gugaggaguc cuggcugugu 12 0 
gggaagcagu caguauaauu cccgucgugu guggugacgc cucacgacgu auuuguccgc 180 
ugugcagagc gaaguaccaa gggcugcacc ccgguuuuug uuccaagcgg agggcaaccc 240 

260 



ccgcuuggaa uuaaaaacug 



40 



45 



50 



55 



60 



65 



<210> 2 
<211> 9399 
<212> DNA 

<213> GBV-A-like virus 
<400> 2 

accacaaaca ctccagtttg ttacactccg ctaggaatgc tcctggagca ccccccctag 60 
cagggcgtgg gggatttccc ctgcccgtct gcagaagggt ggagccaacc accttagtat 120 
gtaggcggcg ggactcatga cgctcgcgtg atgacaagcg ccaagcttga cttggatggc 18 0 
cctgatgggc gttcatgggt tcggtggtgg tggcgcttta ggcagcctcc acgcccacca 24 0 
cctcccagat agagcggcgg cactgtaggg aagaccgggg accggtcact accaaggacg 3 00 
cagacctctt tttgagtatc acgcctccgg aagtagttgg gcaagcccac ctatatgtgt 3 60 
tgggatggtt ggggttagcc atccataccg tactgcctga tagggtcctt gcgaggggat 420 
ctgggagtct cgtagaccgt agcacatgcc tgttatttct actcaaacaa gtcctgtacc 480 
tgcgcccaga acgcgcaaga acaagcagac gcaggcttca tatcctgtgt ccattaaaac 540 
atctgttgaa aggggacaac gagcaaagcg caaagtccag cgcgatgctc ggcctcgtaa 600 
ttacaaaatt gctggtatcc atgatggctt gcagacattg gctcaggctg ctttgccagc 660 
tcatggttgg ggacgccaag accctcgcca taagtctcgc aatcttggaa tccttctgga 720 
ttaccctttg gggtggattg gtgatgttac aactcacaca cctctagtag gcccgctggt 780 
ggcaggagcg gtcgttcgac cagtctgcca gatagtacgc ttgctggagg atggagtcaa 840 
ctgggctact ggttggttcg gtgtccacct ttttgtggta tgtctgctat ctttggcctg 900 
tccctgtagt ggggcgcggg tcactgaccc agacacaaat accacaatcc tgaccaattg 960 
ctgccagcgt aatcaggtta tctattgttc tccttccact tgcctacacg agcctggttg 1020 
tgtgatctgt gcggacgagt gctgggttcc cgccaatccg tacatctcac acccttccaa 1080 
ttggactggc acggactcct tcttggctga ccacattgat tttgttatgg gcgctcttgt 1140 
gacctgtgac gcccttgaca ttggtgagtt gtgtggtgcg tgtgtattag tcggtgactg 1200 
gcttgtcagg cactggctta ttcacataga cctcaatgaa actggtactt gttacctgga 1260 
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agtgcccact ggaatagatc ctgggttcct 
cgaggctgtc atcttcttga ccaaactggc 
gtttagcagt gtacactacc tggcggttgg 
gtggtatcag ttgctcctag cgcttatgct 
5 cagggtgccc actggatgct caatagctga 
ttgccactct tatttgagtg agaatgtgtc 
caggcctatc actctagagt ataacaactc 
tgcgagggga tgtatggtta aattcaaaaa 
tgtgccatcg tactgcacta tgggcactga 

10 cgaagtatgc ggtgtaacac catggctaac 
attggctata ttacaatacc ctgggtctaa 
aggccatttg tattttgagg gatcagatac 
ttccactctc ctaccaccgg agaggtgggc 
tggttcttgg ttacaggttc cgcaagggtt 

15 attgatcacc aaagacaaag cctggaaaaa 
tttgtctctt acgggagtta ccaccaaggc 
cagcaagtat cttattttag cctacctctg 
tggttaccct ttgcgtcctg tgctcccatc 
tttgtctaaa gctcaagtag ctccttttgc 

20 ctgcaggcta cgttatgctg cccttttagg 
aactttcttt gttgcagcag ctgctgccca 
agtggcaggg ttagttttgt gggccggccg 
aggtccttgg cctctggtag cgcttttaac 
ttttgatacc gagataattg gagggctgac 

25 gtctcgtttt ggcttctttg cfccacttgtt 
ttggcaacgt tgggagaatt ggttttggaa 
tgtgctggtt tgtttccccg gtgcgacata 
cgtagctctt ctatgtttaa catccagtgc 
tagggcccat agaatgttgg tgcgtctcgg 

30 tcttaagttt ttcctcttag tgtttggtga 
tggt^gatgtc ttgcctaatg attttgcctc 
ttttgaaggc aaggcaaggg tctataggaa 
ggttgatggt ttgcccgttg ttgcgcgtct 
gccgccagat gggtgggcca ttaccgcacc 

35 cacgctgtca gcgatggcag tggtcatgac 
tatcttcaga ttaggatctc tggccactag 
gtatactgct caccatggca gcaaggggcg 
cccaataacc gttgacgcgg ctaatgacca 
gtcccttact cggtgctctt gcggggagac 

40 attggttgag gtcaacaaat ccgatgaccc 
ggctgttgcc aagggttctt caggtgcccc 
gatgttcacc gctgctagaa attctggcgg 
ggtgtgtgct ggataccatc cccagtacac 
tgtgcctaac gagtattcag tgcaaatttt 

45 caaattacca ctttcttaca tgcaggagaa 
ggctacaaca gcatcaatgc caaagtacat 
ctattttaat ggcaaatgta ccaacacagg 
gtacctgacc ggagcatgtt cccggaacta 
taccgatgca accaccgtgt tgggcattgg 

50 tgttaggcta gtggttcttg ccacggctac 
caacataact gagattcaat taaccgatga 
taaggaggaa aatctgaaga aagggagaca 
tgatgagctt gctaacgagt tagctcgaaa 
atgtgacatc tcaaaaatcc ctgagggcga 

55 tacagggtac actggtgact ttgattccgt 
atgccatgtt gaccttgacc ctactttcac 
aatagttaaa ggccagcgta ggggccgcac 
tgtagacggg agttgtaccc cttcgggtat 
cgacgcagcc aaggcatggt atggtttgtc 

60 ctatcgcacc caacctgggt tacctgcgat 
cttttctatg gtcaaccccg aaccttcatt 
ttatgttttg ttgactgcag cccaactaca 
caatgacgca ccacggtggc agggagcccg 
gcgcttggac ggcgctgacg cctgtcctgg 

65 aatgtgcttc actgaagtca atacttctgg 
ggctatggct tatctagcca ttgacacttt 



agggtttatc gggtggatgg ccggcaaggt 132 0 
ttcacaagta ccatacgcta ttgcgactat 1380 
cgctctgatc tactatgcct ctcggggcaa 1440 
ttacatagaa gcgacctctg gaaaccctat 1500 
gttttgctcg cctttgatga taccatgtcc 1560 
agaagtcatt tgttacagtc caaagtggac 1620 
catatcttgg tacccctata caatccctgg 1680 
taacacatgg ggttgctgcc gtattcgcaa 1740 
tgcagtgtgg aacgacactc gcaacactta 1800 
aaccgcatgg cacaacggct cagccctgaa 1860 
agaaatgttt aaacctcata attggatgtc 1920 
ccctatagtt tacttttatg accctgtgaa 1980 
taggttgccc ggtaccccac ctgtggtacg 2040 
ttacagtgat gtgaaagacc tagccacagg 2100 
ttatcaggtc ttatattccg ccacgggtgc 2160 
cgtggtgcta attctgttgg ggttgtgtgg 2220 
ttacttgtcc ctttgttttg ggcgcgcttc 2280 
ccagtcgtat ctccaagctg gctgggatgt 2340 
tttgattttc ttcatctgtt gctatctccg 2400 
gtttgtgccc atggctgcgg gcttgcccct 24 60 
accagattat gactggtggg tgcgactgct 2520 
taaccgtggt caccgcatag ctctgcttgt 2580 
cctcttgcat ttggttacgc ctgcttcagc 2640 
aataccacct gtagtagcat tagttgtcat 2700 
acctcgctgt gctttagtta actcctatct 2760 
cgttacacta agaccggaga ggtttttcct 2820 
tgacgcgctg gtgactttct gtgtgtgtca 2 880 
agcatcgttc tttgggactg actctagggt 2940 
aaagtgtcat gcttggtatt ctcattatgt 3000 
gaatggtgtg tttttctata agcacttgca 3060 
gaaactacca ttgcaagagc catttttccc 3120 
tgaaggaaga cgcttggcgt gtggggacac 3180 
cggcgacctt gttttcgcag ggttggctat 3240 
ttttacgctg cagtgtctct ctgaacgtgg 3300 
tggtatagac ccccgaactt ggactggaac 3360 
ctacatggga tttgtttgtg acaacgtgtt 342 0 
ccggttggct catcccacag gctctataca 3480 
ggacatctat caaccaccat gtggagctgg 3540 
caaggggtat ctggtaacac gactggggtc 3600 
ttattggtgt gtgtgcgggg cccttcccat 3660 
gattctgtgc tcctccgggc atgfctattgg 3720 
ttcagtcagt cagattaggg ttaggccgtt 3780 
agcacatgcc actcttgata caaaacctac 384 0 
aattgccccc actggcagcg gcaagtcaac 3 900 
gtatgaggtc ttggtcctaa atcccagtgt 3960 
gcacgcgacg tacggcgtga atccaaattg 4020 
ggcttcactt acgtacagca catatggcat 40 80 
tgatgtaatc atttgtgacg aatgccatgc 414 0 
aaaggtccta accgaagctc catccaaaaa 4200 
cccccctgga gtaatcccta caccacatgc 4260 
aggcactatc ccctttcatg gaaaaaagat 4320 
ccttatcttt gaggctacca aaaaacactg 43 80 
gggaataaca gctgtctctt actatagggg 4440 
ctgtgtagta gttgccactg atgccttgtg 4500 
gtatgactgc agcctcatgg tagaaggcac 4560 
catgggtgtt cgtgtgtgcg gggtttcagc 4620 
aggccgtggg agagctggca tatactacta 4680 
ggttcctgaa tgcaacattg ttgaagcctt 4740 
atcaacagaa gctcaaacta ttctggacac 4 800 
aggagcaaat ttggacgagt gggctgatct 4860 
tgtcaatact gcaaaaagaa ctgctgacaa 4920 
actgtgtcat cagtatggct atgctgctcc 4980 
gcttgggaaa aaaccttgtg gggttctgtg 5040 
cccagagccc agcgaggtga ccagatacca 5100 
gacagccgca ctcgctgttg gcgttggagt 5160 
tggcgccact tgtgtgcggc gttgctggtc 5220 
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tattgcatca gtccctaccg gtgctactgt 
99 a ggagtgt gcatcattca ttcccttgga 
gagtacaatc accacaacta gtcctttcac 
ctttcttggg cctcatgcag ctacaatcct 
5 cactttacct gacaatccct ttgcatcatg 
cccactacct cacaagatca aaatgttcct 
gcttacagac gctagaggcg cactggcgtt 
tggtacatgg acatcggtgg gttttgtctt 
atccactgct tgcttgacat ttaaatgctt 

10 tgctggttta gtctactccg cgttcaatcc 
ttgtgcaatg tttgctttga caacagcagg 
tatgcttgct aggagcaaca ctgtatgtaa 
caggaagata ctgggcattc tggaggcatc 
ccgttggctc cacaccccga cggaggatga 

15 ttggcagtat gtgtgcaatt tctttgtgat 
gagcatggtt aacattcctg gttgtccttt 
ctggattgga tcaggtatgc tccaagcacg 
tgttgagaat ggttttgcaa aactttacaa 
aggggctgtt ccagtcaacg ctaggctgtg 

20 gactagtctt gtcgtcaatt atggcgttag 
tcacattttt gttacagcag tatcctctcc 
cttgagagct gcagtggccg tggacggcgt 
aactccttgg acgacatctg cttgctgtta 
gcttcccttc cgcgttgacg gtcacacacc 

25 tgcacttgag acaaatgact gtaattccac 
gtccgctctt gttttcaaac aggagttgcg 
a gctggcgtt gacaccacca aactgccagc 
gcgccagttc cgggcaagaa ctggttcgct 
aggagtgtca tgtcctgaaa gcctgcaacg 

30 ccctccttca ccacctgttc tacagttggc 
gtgt^aaccct ttcactgcaa ttggatgtgc 
tttacccagt taccctccca aaaaggaggt 
ggctacaacc gtttccagct acgttactgg 
ttccactcag tcagcccccg ccaaacggcc 

35 ttcgtgcagc atgagctaca cctggaccga 
tctgtctgca actcgggcca tcactagtgg 
gactgagccg cgggatgcgg agcttagaaa 
gttccGccca tcataccaca agcaagtgag 
cggtgtcatg tgggactatg atgaagtagc 

40 ccacatcact ggccttcggg gcactgatgt 
ggacttgcag aagtgtgtcg aggcaggtga 
agttccaaag gaggaggtct tcgtgaagac 
gcttatctcg tacccccacc ttgaaatgag 
tgctcctgac gtagttaaag ctgtcatggg 

45 ccgtgtcaag cgtctgttgt cgatgtggtc 
agtgtgtttt gacagtacca tcacacccga 
agcagctaaa ctcagtgacc aacaccgagc 
cgctggagga ccgatgatcg cttatgatgg 
ttccggcgtc tatactacct caagttccaa 

50 tgcagccgaa caggctggca tgaagaaccc 
cgtaatttgg aagagcgccg gagcagatgc 
ctggatgaag gtgatgggtg caccacaaga 
agaattaaca tcatgctcat caaatgttac 
ctactttctt acaagagatc ctcgtatccc 

55 atacaacccc agtgctgcgt ggattgggta 
tagccgtgtg ttggctgtcc atttcatgga 
gacggtgacc tttgactggt atgggaaaaa 
catcattgct ggtgtgcacg gtattgaggc 
gatcctcaga gtttcccaat cactaacaga 

60 aaagaaagcc agggcggtcc tcgccagcgc 
ggctcgcttc cttctctggc atgctacatc 
cgtggctcgg tacaccactt tcaattattg 
tattacacca cagagaagat tgcagaagtt 
tgccctaggg ctcattgctg ttggattagc 

65 ctaacagttt tttttttttt tttttttttt 
ttaacgaccc cgccgatgtg agtttggcga 



cgccccagtg gttgacgaag aagaaatcgt 5280 
ggccatggtt gctgcaatcg ataagctgaa 5340 
attggaaacc gcccttgaaa aacttaacac 5400 
tgctatcata gagtattgct gtggtttagt 5460 
cgtgtttgct ttcattgcgg gtattactac 5520 
gtcattattt ggaggcgcaa ttgcgtccaa 5580 
catgatggcc ggggctgcgg gaacagctct 5640 
tgacatgcta ggcggctatg ctgccgcctc 5700 
9 a tgggtgag tggcccacta tggatcagct 5760 
ggccgcagga gttgtgggcg tcttgtcagc 5820 
gccagatcac tggcccaaca gacttcttac 5880 
tgagtacttt attgccactc gtgacatccg 5940 
taccccctgg agtgtcatat cagcttgcat 6000 
ttgcggcctc attgcttggg gtctagagat 6060 
ttgctttaat gtccttaaag ctggagttca 6120 
ctacagctgc cagaaggggt acaagggccc 6180 
ctgtccatgc ggtgctgaac tcatcttttc 6240 
aggacccaga acttgttcaa attactggag 6300 
tgggtcggct agaccggacc caactgattg 6360 
ggactactgt aaatatgaga aaatgggaga 6420 
aaatgtctgt ttcacccagg tgcccccaac 6480 
acaggttcag tgttatctag gtgagcccaa 6540 
cggtcctgac ggtaagggta aaactgttaa 6600 
tggtgtgcgc atgcaactta atttgcgtga 6660 
aaacaacact cctagtgatg aagccgcagt 6720 
gcgtacaaac caattgcttg aggcaatttc 6780 
cccctccatc gaagaggtag tggtaagaaa 684 0 
taccttgcct ccccctccga gatccgtccc 6900 
aagtgacccg ttagaaggtc cttcaaacct 6960 
catgccgatg cccctgttgg gagcgggtga 7 02 0 
aatgaccgaa acaggcggag gccctgatga 7080 
ctctgaatgg tcagacgaaa gttggtcgac 7140 
ccccccgtac cctaagatac ggggaaagga 7200 
tacaaaaaag aagttgggaa agagtgagtt 7260 
cgtgattagc ttcaaaactg cttctaaagt 7320 
tttcctcaaa caaagatcat tggtgtatgt 7380 
acaaaaagtc actattaata gacaacctct 7440 
attggctaag gaaaaagctt caaaagttgt 7500 
agctcacacg ccctctaagt ctgctaagtc 7560 
tcgttctgga gcagcccgca aggctgttct 7620 
gataccgagt cattatcggc aaactgtgat 7680 
cccccagaaa ccaacaaaga aacccccaag 7740 
atgtgttgag aagatgtact acggtcaggt 7800 
agatgcgtac gggtttgtag atccacgtac 7860 
acccgatgca gtcggagcca catgcgatac 7920 
ggatatcatg gtggagacag acatctactc 7980 
tggcattcac accattgcga ggcagttata 8040 
ccgagagatc ggatatcgta ggtgtaggtc 8100 
cagtttgacc tgctggctga aggtaaatgc 8160 
tcgcttcctt atttgcggcg atgattgcac 8220 
agacaaacaa gcaatgcgtg tctttgctag 8280 
ttgtgtgcct caacccaaat acagtttgga 8340 
ctctggaatt accaaaagtg gcaagcctta 8400 
ccttggcagg tgctctgccg agggtctggg 8460 
tctaatacat cactacccat gtttgtgggt 8520 
gcagatgctc tttgaggaca aacttcccga 8580 
ttatacggtg cctgtagaag atctgcccag 8640 
tttctcggtg gtgcgctaca ccaacgctga 8700 
catgaccatg ccccccctgc gagcctggcg 8760 
caagaggcgt ggcggagcac acgcaaaatt 8820 
tagacctcta ccagatttgg ataagacgag 8880 
tgatgtttac tccccggagg gggatgtgtt 8940 
tcttgtgaag tatttggctg tcattgtttt 9000 
catcagctga acccccaaat tcaaaattaa 9060 
a 99gcagcgg caacagggga gaccccgggc 9120 
ccatggtgga tcagaaccgt ttcgggtgaa 9180 
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gccatggtct gaaggggatg acgtcccttc tggctcatcc acaaaaaccg tctcgggtgg 9240 
gtgaggagtc ctggctgtgt gggaagcagt cagtataatt cccgtcgtgt gtggtgacgc 9300 
ctcacgacgt atttgtccgc tgtgcagagc gtagtaccaa gggctgcacc ccggtttttg 9360 
ttccaagcgg agggcaaccc ccgcttggaa ttaaaaact 9399 



<210> 3 
<211> 48 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
15 <400> 3 

cgggatcccg taatacgact cactatagac cacaaacact ccagtttg 4 8 



<210> 4 
20 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Primer 
<400> 4 

gtggaattca cagcgtcata 20 

30 

<21Q> 5 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 5 

40 tgtgaattcc actctcctac c 21 

<210> 6 
<211> 21 
45 <212> DNA 

<213> Artificial Sequence 



35 



50 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 6 

ttatcgattg cagcaaccat g 21 



55 <210> 7 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
60 <220> 

<223> Description of Artificial Sequence: Primer 
<400> 7 

catggttgct gcaatcgata agctgaagag tacaataac 39 
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<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 8 
10 gacaacagac gcttgacacg 



<210> 9 
<211> 20 
15 <212> DNA 

<213> Artificial Sequence 



20 



35 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 9 

ctgtcatggg agatgcgtac 2 0 



25 <210> 10 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223p Description of Artificial Sequence: Primer 



<400> 10 

cgagctcgag cacatcgcgg ggtcgttaag cccggggtct cc 42 



<210> 11 
<211> 20 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
45 <400> 11 

gacaacagac gcttgacacg 20 

<210> 12 
50 <211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Primer 
<400> 12 

ccgactcgag aattcggccc tgcaggccac aacagtctcg cgagttttta attccaagcg 60 



60 



ggggttgccc tccgcttgga acaaaaaccg gggtgcagcc cttggtac 108 



<210> 13 
<211> 9599 
<212> DNA 
65 <213> Hepatitis C virus 
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<220> 
<221> CDS 

<222> (342) . . (9377) 
5 <400> 13 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
10 cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gataaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 



15 



35 



55 



60 



65 



gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac c atg age acg aat cct 356 

Met Ser Thr Asn Pro 
1 5 



20 aaa cct caa aga aaa acc aaa cgt aac acc aac cgt cgc cca cag gac 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
10 15 20 



404 



gtc aag ttc ccg ggt ggc ggt cag ate gtt ggt gga gtt tac ttg ttg 452 
25 Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly Gly Val Tyr Leu Leu 

25 30 35 

ccg cgc agg ggc cct aga ttg ggt gtg cgc gcg acg agg aag act tec 500 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser 
30 40 45 50 

r 

gag egg teg caa cct cga ggt aga cgt cag cct ate ccc aag gca cgt 54 8 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys Ala Arg 
55 60 65 



egg ccc gag ggc agg acc tgg get cag ccc ggg tac cct tgg ccc etc 596 
Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
70 75 80 85 



40 tat ggc aat gag ggt tgc ggg tgg gcg gga tgg etc ctg tct ccc cgt 644 

Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 95 100 

ggc tct egg cct age tgg ggc ccc aca gac ccc egg cgt agg teg cgc 692 

45 Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg 
105 110 115 

aat ttg ggt aag gtc ate gat acc ctt acg tgc ggc ttc gee gac etc 740 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
50 120 125 130 



atg ggg tac ata ccg etc gtc ggc gee cct ctt gga ggc get gee agg 788 
Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
135 140 145 

gee ctg gcg cat ggc gtc egg gtt ctg gaa gac ggc gtg aac tat gca 83 6 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
150 155 160 165 

aca ggg aac ctt cct ggt tgc tct ttc tct ate ttc ctt ctg gee ctg 884 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
170 175 180 

etc tct tgc ctg act gtg ccc get tea gee tac caa gtg cgc aat tec 932 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn Ser 
185 190 195 
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teg ggg ctt tac cat gtc acc aat gat tgc cct aac teg agt att gtg 980 
Ser Gly Leu Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val 
200 205 210 

tac gag gcg gec gat gec ate ctg cac act ccg ggg tgt gtc cct tgc 102 8 
Tyr Glu Ala Ala Asp Ala He Leu His Thr Pro Gly Cys Val Pro Cys 
215 220 225 

gtt cgc gag ggt aac gec teg agg tgt tgg gtg gcg gtg acc ccc acg 1076 
Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Val Ala Val Thr Pro Thr 
230 235 240 245 

gtg gee acc agg gac ggc aaa etc ccc aca acg cag ctt cga cgt cat 1124 
Val Ala Thr Arg Asp Gly Lys Leu Pro Thr Thr Gin Leu Arg Arg His 
250 255 260 

ate gat ctg ctt gtc ggg age gec acc etc tgc teg gec etc tac gtg 1172 
He Asp Leu Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac ctg tgc ggg tct gtc ttt ctt gtt ggt caa ctg ttt acc ttc 1220 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Gly Gin Leu Phe Thr Phe 
280 285 290 

tct ccc agg cgc cac tgg acg acg caa gac tgc aat tgt tct ate tat 1268 
Ser Pro Arg Arg His Trp Thr Thr Gin Asp Cys Asn Cys Ser lie Tyr 
295 300 305 

ccc ggc cat ata acg ggt cat cgc atg gca tgg gat atg atg atg aac 1316 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 

tgg tec cct acg gca gcg ttg gtg gta get cag ctg etc egg ate cca 1364 
Trp Ser Pro Thr Ala Ala Leu Val Val Ala Gin Leu Leu Arg He Pro 
330 335 340 

caa gee ate atg gac atg ate get ggt get cac tgg gga gtc ctg gcg 1412 
Gin Ala He Met Asp Met He Ala Gly Ala His Trp Gly Val Leu Ala 
345 350 355 

ggc ata gcg tat ttc tec atg gtg ggg aac tgg gcg aag gtc ctg gta 14 60 
Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
360 365 370 

gtg ctg ctg eta ttt gec ggc gtc gac gcg gaa acc cac gtc acc ggg 1508 
Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr His Val Thr Gly 
375 380 385 

gga aat gec ggc cgc acc acg get ggg ctt gtt ggt etc ctt aca cca 1556 
Gly Asn Ala Gly Arg Thr Thr Ala Gly Leu Val Gly Leu Leu Thr Pro 
390 395 400 405 

ggc gec aag cag aac ate caa ctg ate aac acc aac ggc agt tgg cac 1604 
Gly Ala Lys Gin Asn He Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 415 420 

ate aat age acg gee ttg aat tgc aat gaa age ctt aac acc ggc tgg 1652 
He Asn Ser Thr Ala Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp 
425 430 435 

tta gca ggg etc ttc tat caa cac aaa ttc aac tct tea ggc tgt cct 1700 
Leu Ala Gly Leu Phe Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

gag agg ttg gee age tgc cga cgc ctt acc gat ttt gee cag ggc tgg 174 8 
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Glu Arg Leu Ala Ser Cys Arg Arg Leu Thr Asp Phe Ala Gin Gly Trp 
455 460 465 

ggt cct ate agt tat gec aac gga age ggc etc gac gaa cgc ccc tac 1796 
Gly Pro He Ser Tyr Ala Asn Gly Ser Gly Leu Asp Glu Arg Pro Tyr 
470 475 480 *~ 485 

tgc tgg cac tac cct cca aga cct tgt ggc att gtg ccc gca aag age 1844 
Cys Trp His Tyr Pro Pro Arg Pro Cys Gly He Val Pro Ala Lys Ser 
490 495 500 

gtg tgt ggc ccg gta tat tgc ttc act ccc age ccc gtg gtg gtg gga 1892 
Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
505 510 515 

acg acc gac agg teg ggc gcg cct ace tac age tgg ggt gca aat gat 1940 
Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser Trp Gly Ala Asn Asp 
520 525 530 

acg gat gtc ttc gtc ctt aac aac acc agg cca ccg ctg ggc aat tgg 1988 
Thr Asp Val Phe Val Leu Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp 
535 540 545 

ttc ggt tgt acc tgg atg aac tea act gga ttc acc aaa gtg tgc gga 2036 
Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Val Cys Gly 
550 555 560 "* 565 

gcg ccc cct tgt gtc ate gga ggg gtg ggc aac aac acc ttg etc tgc 2 084 

Ala Pro Pro Cys Val He Gly Gly Val Gly Asn Asn Thr Leu Leu Cys 
570 575 580 

r 

ccc act gat tgc ttc cgc aaa cat ccg gaa gee aca tac tct egg tgc 2132 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ser Arg Cys 
585 590 595 

ggc tec ggt ccc tgg att aca ccc agg tgc atg gtc gac tac ccg tat 2180 
Gly Ser Gly Pro Trp He Thr Pro Arg Cys Met Val Asp Tyr Pro Tyr 
600 605 610 

agg ctt tgg cac tat cct tgt acc ate aat tac acc ata ttc aaa gtc 222 8 
Arg Leu Trp His Tyr Pro Cys Thr He Asn Tyr Thr He Phe Lys Val 
615 620 625 

agg atg tac gtg gga ggg gtc gag cac agg ctg gaa gcg gee tgc aac 2276 
Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn 
630 635 640 645 

tgg acg egg ggc gaa cgc tgt gat ctg gaa gac agg gac agg tec gag 2324 
Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
650 655 660 

etc age ccg ttg ctg ctg tec acc aca cag tgg cag gtc ctt ccg tgt 2372 
Leu Ser Pro Leu Leu Leu Ser Thr Thr Gin Trp Gin Val Leu Pro Cys 
665 670 675 

tct ttc acg acc ctg cca gee ttg tec acc ggc etc ate cac etc cac 2420 
Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
680 685 690 

cag aac att gtg gac gtg cag tac ttg tac ggg gta ggg tea age ate 2468 
Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ser He 
695 700 705 

gcg tec tgg gee att aag tgg gag tac gtc gtt etc ctg ttc ctt ctg 2516 
Ala Ser Trp Ala He Lys Trp Glu Tyr Val Val Leu Leu Phe Leu Leu 
710 715 720 725 
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ctt gca gac gcg cgc gtc tgc tec tgc ttg tgg atg atg tta etc ata 2564 
Leu Ala Asp Ala Arg Val Cys Ser Cys Leu Trp Met Met Leu Leu He 
730 735 740 

tec caa gcg gag gcg get ttg gag aac etc gta ata etc aat gca gca 2612 
Ser Gin Ala Glu Ala Ala Leu Glu Asn Leu Val He Leu Asn Ala Ala 
745 750 755 



tec ctg gee ggg acg cac ggt ctt gtg tec ttc etc gtg ttc ttc tgc 
Ser Leu Ala Gly Thr His Gly Leu Val Ser Phe Leu Val Phe Phe Cys 
760 765 770 



2660 



ttt gcg tgg tat ctg aag ggt agg tgg gtg ccc gga gcg gtc tac gee 2708 
Phe Ala Trp Tyr Leu Lys Gly Arg Trp Val Pro Gly Ala Val Tyr Ala 
775 780 785 

etc tac ggg atg tgg cct etc etc ctg etc ctg ctg gcg ttg cct cag 2756 
Leu Tyr Gly Met Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Gin 
790 795 800 805 

egg gca tac gca ctg gac acg gag gtg gee gcg teg tgt ggc ggc gtt 2 804 
Arg Ala Tyr Ala Leu Asp Thr Glu Val Ala Ala Ser Cys Gly Gly Val 
810 815 820 

gtt ctt gtc ggg tta atg gcg ctg act ctg teg cca tat tac aag cgc 2 852 
Val Leu Val Gly Leu Met Ala Leu Thr Leu Ser Pro Tyr Tyr Lys Arg 
825 830 835 

tat ate age tgg tgc atg tgg tgg ctt cag tat ttt ctg ace aga gta 2900 
Tyr £le Ser Trp Cys Met Trp Trp Leu Gin Tyr Phe Leu Thr Arg Val 
840 845 850 

gaa gcg caa ctg cac gtg tgg gtt ccc ccc etc aac gtc egg ggg ggg 294 8 
Glu Ala Gin Leu His Val Trp Val Pro Pro Leu Asn Val Arg Gly Gly 
855 860 865 

cgc gat gee gtc ate tta etc atg tgt gta gta cac ccg ace ctg gta 2996 
Arg Asp Ala Val He Leu Leu Met Cys Val Val His Pro Thr Leu Val 
870 875 880 885 

ttt gac ate ace aaa eta etc ctg gec ate ttc gga ccc ctt tgg att 3044 
Phe Asp He Thr Lys Leu Leu Leu Ala He Phe Gly Pro Leu Trp He 
890 895 ~ 900 

ctt caa gee agt ttg ctt aaa gtc ccc tac ttc gtg cgc gtt caa ggc 3092 
Leu Gin Ala Ser Leu Leu Lys Val Pro Tyr Phe Val Arg Val Gin Gly 
905 910 915 

ctt etc egg ate tgc gcg eta gcg egg aag ata gee gga ggt cat tac 3140 
Leu Leu Arg He Cys Ala Leu Ala Arg Lys He Ala Gly Gly His Tyr 
920 925 930 

gtg caa atg gec ate ate aag tta ggg gcg ctt act ggc acc tat gtg 3188 
Val Gin Met Ala He He Lys Leu Gly Ala Leu Thr Gly Thr Tyr Val 
935 940 945 

tat aac cat etc acc cct ctt cga gac tgg gcg cac aac ggc ctg cga 3236 
Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala His Asn Gly Leu Arq 
950 955 960 965 

gat ctg gec gtg get gtg gaa cca gtc gtc ttc tec cga atg gag acc 3284 
Asp Leu Ala Val Ala Val Glu Pro Val Val Phe Ser Arg Met Glu Thr 
970 975 980 

aag etc ate acg tgg ggg gca gat acc gee gcg tgc ggt gac ate ate 3332 
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Lys Leu lie Thr Trp Gly Ala Asp Thr Ala Ala Cys Gly Asp lie lie 
985 990 995 

aac ggc ttg ccc gtc tct gcc cgt agg ggc cag gag ata ctg ctt ggg 33 80 
Asn Gly Leu Pro Val Ser Ala Arg Arg Gly Gin Glu He Leu Leu Gly 
1000 1005 1010 

cca gcc gac gga atg gtc tec aag ggg tgg agg ttg ctg gcg ccc ate 3428 
Pro Ala Asp Gly Met Val Ser Lys Gly Trp Arg Leu Leu Ala Pro He 
1015 1020 1025 

acg gcg tac gcc cag cag acg aga ggc etc eta ggg tgt ata ate acc 3476 
Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys He He Thr 
1030 1035 * 1040 " 1045 

age ctg act ggc egg gac aaa aac caa gtg gag ggt gag gtc cag ate 3524 
Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin He 
1050 1055 1060 

gtg tea act get acc caa acc ttc ctg gca acg tgc ate aat ggg gta 3572 
Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys He Asn Gly Val 
1065 1070 1075 

tgc tgg act gtc tac cac ggg gcc gga acg agg acc ate gca tea ccc 3620 
Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr He Ala Ser Pro 
1080 1085 1090 

aa 9 ggt cct gtc ate cag atg tat acc aat gtg gac caa gac ctt gtg 3668 
Lys Gly Pro Val He Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val 
? 1095 1100 1105 

ggc tgg ccc get cct caa ggt tec cgc tea ttg aca ccc tgt acc tgc 3716 
Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr Pro Cys Thr Cys 
1110 1115 1120 " 1125 

ggc tec teg gac ctt tac ctg gtc acg agg cac gcc gat gtc att ccc 3764 
Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val He Pro 
1130 1135 1140 

gtg cgc egg cga ggt gat age agg ggt age ctg ctt teg ccc egg ccc 3 812 
Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro 
1145 1150 1155 

att tec tac ttg aaa ggc tec teg ggg ggt ccg ctg ttg tgc ccc gcg 3860 
He Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala 
1160 1165 1170 

gga cac gcc gtg ggc eta ttc agg gcc gcg gtg tgc acc cgt gga gtg 3908 
Gly His Ala Val Gly Leu Phe Arg Ala Ala Val Cys Thr Arg Gly Val 
1175 1180 1185 

get aaa gcg gtg gac ttt ate cct gtg gag aac eta ggg aca acc atg 3956 
Ala Lys Ala Val Asp Phe He Pro Val Glu Asn Leu Gly Thr Thr Met 
1190 1195 1200 1205 

aga tec ccg gtg ttc acg gac aac tec tct cca cca gca gtg ccc cag 4004 
Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro Ala Val Pro Gin 
1210 1215 1220 

age ttc cag gtg gcc cac ctg cat get ccc acc ggc age ggt aag age 4052 
Ser Phe Gin Val Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser 
1225 1230 1235 



acc aag gtc ccg get gcg tac gca gcc cag ggc tac aag gtg ttg gtg 
Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val 
1240 1245 1250 



4100 
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etc aac ccc tct gtt get gca acg ctg ggc ttt ggt get tac atg tec 4148 
Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser 
1255 1260 1265 

aag gee cat ggg gtt gat cct aat ate agg acc ggg gtg aga aca att 4196 
Lys Ala His Gly Val Asp Pro Asn He Arg Thr Gly Val Arg Thr He 
1270 1275 1280 1285 

acc act ggc age ccc ate acg tac tec acc tac ggc aag ttc ctt gee 4244 
Thr Thr Gly Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 
1290 1295 1300 

gac ggc ggg tgc tea gga ggt get tat gac ata ata att tgt gac gag 4292 
Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu 
1305 1310 1315 

tgc cac tec acg gat gee aca tec ate ttg ggc ate ggc act gtc ctt 4340 
Cys His Ser Thr Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu 
1320 1325 1330 



gac caa gca gag act gcg ggg gcg aga ctg gtt gtg etc gee act get 43 8 8 
Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala 
1335 1340 1345 

acc cct ccg ggc tec gtc act gtg tec cat cct aac ate gag gag gtt 4436 
Thr Pro Pro Gly Ser Val Thr Val Ser His Pro Asn He Glu Glu Val 
1350 1355 1360 1365 

get ctg tec acc acc gga gag ate ccc ttt tac ggc aag get ate ccc 4484 
Ala fceu Ser Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro 
1370 1375 " 1380 

etc gag gtg ate aag ggg gga aga cat etc ate ttc tgc cac tea aag 4532 
Leu Glu Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys 
1385 1390 1395 

aag aag tgc gac gag etc gee gcg aag ctg gtc gca ttg ggc ate aat 4580 
Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn 
1400 1405 1410 

gec gtg gee tac tac cgc ggt ctt gac gtg tct gtc ate ccg acc age 4628 
Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser 
1415 1420 1425 

ggc gat gtt gtc gtc gtg teg acc gat get etc atg act ggc ttt acc 4676 
Gly Asp Val Val Val Val Ser Thr Asp Ala Leu Met Thr Gly Phe Thr 
^30 1435 1440 J 1445 

ggc gac ttc gac tct gtg ata gac tgc aac acg tgt gtc act cag aca 4724 
Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr 
1450 1455 1460 

gtc gat ttc age ctt gac cct acc ttt acc att gag aca acc acg etc 4772 
Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu 
1465 1470 1475 

ccc cag gat get gtc tec agg act caa cgc egg ggc agg act ggc agg 4820 
Pro Gin Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg 
1480 1485 1490 

ggg aag cca ggc ate tat aga ttt gtg gca ccg ggg gag cgc ccc tec 4868 
Gly Lys Pro Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser 
1495 1500 1505 

ggc atg ttc gac teg tec gtc etc tgt gag tgc tat gac gcg ggc tgt 4916 
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Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys 
1510 1515 1520 1525 



get tgg tat gag etc acg ccc gec gag act aca gtt agg eta cga gcg 
Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala 
1530 1535 1540 



4964 



tac atg aac acc ccg ggg ctt ccc gtg tgc cag gac cat ctt gaa ttt 
Tyr Met Asn Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe 
1545 1550 1555 



5012 



tgg gag ggc gtc ttt acg ggc etc act cat ata gat gec cac ttt tta 
Trp Glu Gly Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu 
1560 1565 1570 



5060 



tec cag aca aag cag agt ggg gag aac ttt cct tac ctg gta gcg tac 
Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Pro Tyr Leu Val Ala Tyr 
1575 1580 1585 



5108 



caa gec acc gtg tgc get agg get caa gee cct ccc cca teg tgg gac 
Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp 
1590 1595 1600 1605 



5156 



cag atg tgg aag tgt ttg ate cgc ctt aaa ccc acc etc cat ggg cca 
Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu His Gly Pro 
1610 1615 1620 



5204 



aca ccc ctg eta tac aga ctg ggc get gtt cag aat gaa gtc acc ctg 5252 

Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu Val Thr Leu 
1625 1630 1635 

? 

acg cac cca ate acc aaa tac ate atg aca tgc atg teg gec gac ctg 5300 

Thr His Pro He Thr Lys Tyr He Met Thr Cys Met Ser Ala Asp Leu 
1640 1645 1650 



gag gtc gtc acg age acc tgg gtg etc gtt ggc ggc gtc ctg get get 
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala 
1655 1660 1665 



5348 



ctg gee gcg tat tgc ctg tea aca ggc tgc gtg gtc ata gtg ggc agg 
Leu Ala Ala Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg 
1670 1675 1680 1685 



5396 



ate gtc ttg tec ggg aag ccg gca att ata cct gac agg gag gtt etc 
He Val Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu 
1690 1695 " 1700 



5444 



tac cag gag ttc gat gag atg gaa gag tgc tct cag Cac tta ccg tac 
Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr 
1705 1710 1715 



5492 



ate gag caa ggg atg atg etc get gag cag ttc aag cag aag gec etc 
He Glu Gin Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu 
1720 1725 " 1730 



5540 



ggc etc ctg cag acc gcg tec cgc cat gca gag gtt ate acc cct get 
Gly Leu Leu Gin Thr Ala Ser Arg His Ala Glu Val He Thr Pro Ala 
1735 1740 1745 



5588 



gtc cag acc aac tgg cag aaa etc gag gtc ttt tgg gcg aag cac atg 
Val Gin Thr Asn Trp Gin Lys Leu Glu Val Phe Trp Ala Lys His Met 
1750 1755 1760 1765 



5636 



tgg aat ttc ate agt ggg ata caa tac ttg gcg ggc ctg tea acg ctg 
Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu 
1770 1775 1780 



5684 
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cct ggt aac ccc gcc att get tea ttg atg get ttt aca get gee gtc 5732 
Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala Phe Thr Ala Ala Val 
1785 1790 1795 

acc age cca eta ace act ggc caa ace etc etc ttc aac ata ttg ggg 5780 
Thr Ser Pro Leu Thr Thr Gly Gin Thr Leu Leu Phe Asn lie Leu Gly 
1800 1805 1810 

999 tgg gtg get gcc cag etc gcc gcc ccc ggt gcc get act gcc ttt 5828 
Gly Trp Val Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe 
1815 1820 1825 

gtg ggt get ggc eta get ggc gcc gcc ate ggc age gtt gga ctg ggg 5876 
Val Gly Ala Gly Leu Ala Gly Ala Ala lie Gly Ser Val Gly Leu Gly 
1830 1835 1840 1845 

aag gtc etc gtg gac att ctt gca ggg tat ggc gcg ggc gtg gcg gga 5924 
Lys Val Leu Val Asp lie Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly 
1850 1855 "* I860 

get ctt gta gca ttc aag ate atg age ggt gag gtc ccc tec acg gag 5972 
Ala Leu Val Ala Phe Lys lie Met Ser Gly Glu Val Pro Ser Thr Glu 
1865 1870 1875 

gac ctg gtc aat ctg ctg ccc gcc ate etc teg cct gga gcc ctt gta 6020 
Asp Leu Val Asn Leu Leu Pro Ala lie Leu Ser Pro Gly Ala Leu Val 
1880 1885 1890 

gtc ggt gtg gtc tgc gca gca ata ctg cgc egg cac gtt ggc ccg ggc 6068 
Val &ly Val Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly 
1895 1900 1905 

gag ggg gca gtg caa tgg atg aac egg eta ata gcc ttc gcc tec egg 6116 
Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg 
1910 1915 1920 1925 

ggg aac cat gtt tec ccc acg cac tac gtg ccg gag age gat gca gcc 6164 
Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala 
1930 1935 1940 

gcc cgc gtc act gcc ata etc age age etc act gta acc cag etc ctg 6212 
Ala Arg Val Thr Ala He Leu Ser Ser Leu Thr Val Thr Gin Leu Leu 
1945 1950 1955 

agg cga ctg cat cag tgg ata age teg gag tgt acc act cca tgc tec 6260 
Arg Arg Leu His Gin Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser 
I960 1965 1970 

ggt tec tgg eta agg gac ate tgg gac tgg ata tgc gag gtg ctg age 6308 
Gly Ser Trp Leu Arg Asp He Trp Asp Trp He Cys Glu Val Leu Ser 
1975 1980 1985 

gac ttt aag acc tgg ctg aaa gcc aag etc atg cca caa ctg cct ggg 6356 
Asp Phe Lys Thr Trp Leu Lys Ala Lys Leu Met Pro Gin Leu Pro Gly 
1990 1995 2000 2005 

att ccc ttt gtg tec tgc cag cgc ggg tat agg ggg gtc tgg cga gga 6404 
He Pro Phe Val Ser Cys Gin Arg Gly Tyr Arg Gly Val Trp Arg Gly 
2010 2015 2020 

gac ggc att atg cac act cgc tgc cac tgt gga get gag ate act gga 6452 
Asp Gly He Met His Thr Arg Cys His Cys Gly Ala Glu He Thr Gly 
2025 2030 ~ 2035 

cat gtc aaa aac ggg acg atg agg ate gtc ggt cct agg acc tgc agg 6500 
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His Val Lys Asn Gly Thr Met Arg lie Val Gly Pro Arg Thr Cys Arg 
2040 2045 2050 



aac atg tgg agt ggg acg ttc ccc att aac gcc tac acc acg ggc ccc 
Asn Met Trp Ser Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly Pro 
2055 2060 2065 



6548 



tgt act ccc ctt cct gcg ccg aac tat aag ttc gcg ctg tgg agg gtg 
Cys Thr Pro Leu Pro Ala Pro Asn Tyr Lys Phe Ala Leu Trp Arg Val 
2070 2075 2080 2085 



6596 



tct gca gag gaa tac gtg gag ata agg egg gtg ggg gac ttc cac tac 
Ser Ala Glu Glu Tyr Val Glu lie Arg Arg Val Gly Asp Phe His Tyr 
2090 2095 2100 



6644 



gta teg ggt atg act act gac aat ctt aaa tgc ccg tgc cag ate cca 
Val Ser Gly Met Thr Thr Asp Asn Leu Lys Cys Pro Cys Gin lie Pro 
2105 2110 2115 



6692 



teg ccc gaa ttt ttc aca gaa ttg gac ggg gtg cgc eta cac agg ttt 
Ser Pro Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe 
2120 2125 2130 



6740 



gcg ccc cct tgc aag ccc ttg ctg egg gag gag gta tea ttc aga gta 
Ala Pro Pro Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val 
2135 2140 2145 



6788 



gga etc cac gag tac ccg gtg ggg teg caa tta cct tgc gag ccc gaa 
Gly Leu His Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu 
2150 2155 2160 2165 



6836 



ccg gac gta gcc gtg ttg acg tec atg etc act gat ccc tec cat ata 
Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His lie 
2170 2175 2180 



6884 



aca gca gag gcg gcc ggg aga agg ttg gcg aga ggg tea ccc cct tct 
Thr Ala Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser 
2185 2190 2195 



6932 



atg gcc age tec teg get age cag ctg tec get cca tct etc aag gca 
Met Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala 
2200 2205 2210 



6980 



act tgc acc gcc aac cat gac tec cct gac gcc gag etc ata gag get 
Thr Cys Thr Ala Asn His Asp Ser Pro Asp Ala Glu Leu lie Glu Ala 
2215 2220 2225 



7028 



aac etc ctg tgg agg cag gag atg ggc ggc aac ate acc agg gtt gag 7076 
Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val Glu 
2230 2235 2240 2245 

tea gag aac aaa gtg gtg att ctg gac tec ttc gat ccg ctt gtg gca 7124 
Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala 
2250 2255 2260 



gag gag gat gag egg gag gtc tec gta cct gca gaa att ctg egg aag 
Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg Lys 
2265 2270 2275 



7172 



tct egg aga ttc gcc egg gcc ctg ccc gtc tgg gcg egg ccg gac tac 7220 
Ser Arg Arg Phe Ala Arg Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac ccc ccg eta gta gag acg tgg aaa aag cct gac tac gaa cca cct 72 68 
Asn Pro Pro Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro 
2295 2300 " 2305 



14 



WO 2004/005498 



PCT/US2003/021002 



gtg gtc cat ggc tgc ccg eta cca cct cca egg tec cct cct gtg cct 
Val Val His Gly Cys Pro Leu Pro Pro Pro Arg Ser Pro Pro Val Pro 
2310 2315 2320 2325 



7316 



ccg cct egg aaa aag cgt acg gtg gtc etc ace gaa tea ace eta tct 
Pro Pro Arg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser 
2330 2335 2340 



7364 



act gee ttg gee gag ctt gee acc aaa agt ttt ggc age tec tea act 
Thr Ala Leu Ala Glu Leu Ala Thr Lys Ser Phe Gly Ser Ser Ser Thr 
2345 2350 2355 



7412 



tec ggc att acg ggc gac aat acg aca aca tec tct gag ccc gee cct 
Ser Gly lie Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro 
2360 2365 2370 



7460 



tct ggc tgc ccc ccc gac tec gac gtt gag tec tat tct tec atg ccc 
Ser Gly Cys Pro Pro Asp Ser Asp Val Glu Ser Tyr Ser Ser Met Pro 
2375 2380 2385 



7508 



ccc ctg gag ggg gag cct ggg gat ccg gat etc age gac ggg tea tgg 
Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp 
2390 2395 2400 ^ "* 2405 



7556 



teg acg gtc agt agt ggg gee gac acg gaa gat gtc gtg tgc tgc tea 7604 

Ser Thr Val Ser Ser Gly Ala Asp Thr Glu Asp Val Val Cys Cys Ser 
2410 2415 2420 

atg tct tat tec tgg aca ggc gca etc gtc acc ccg tgc get gcg gaa 7652 

Met £er Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu 
2425 2430 2435 



gaa caa aaa ctg ccc ate aac gca ctg age aac teg ttg eta cgc cat 
Glu Gin Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His 
2440 2445 2450 



7700 



cac aat ctg gtg tat tec acc act tea cgc agt get tgc caa agg cag 
His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin 
2455 2460 2465 



7748 



aag aaa gtc aca ttt gac aga ctg caa gtt ctg gac age cat tac cag 7796 

Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin 
2470 2475 2480 *" 2485 

gac gtg etc aag gag gtc aaa gca gcg gcg tea aaa gtg aag get aac 7844 

Asp Val Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn 
2490 2495 * 2500 

ttg eta tec gta gag gaa get tgc age ctg acg ccc cca cat tea gee 7892 

Leu Leu Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala 
2505 2510 2515 



aaa tec aag ttt ggc tat ggg gca aaa gac gtc cgt tgc cat gee aga 
Lys Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg 
2520 2525 2530 



7940 



aag gee gta gee cac ate aac tec gtg tgg aaa gac ctt ctg gaa gac 7988 

Lys Ala Val Ala His lie Asn Ser Val Trp Lys Asp Leu Leu Glu Asp 

2535 2540 2545 

agt gta aca cca ata gac act acc ate atg gee aag aac gag gtt ttc 8036 

Ser Val Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu Val Phe 

2550 2555 2560 2565 

tgc gtt cag cct gag aag ggg ggt cgt aag cca get cgt etc ate gtg 8084 
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Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro 
50 2760 2765 



55 



60 



8132 



8180 



Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val 
2570 2575 2580 

ttc ccc gac ctg ggc gtg cgc gtg tgc gag aag atg gcc ctg tac gac 
*> Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp 
2585 2590 2595 

gtg gtt age aag etc ccc ctg gcc gtg atg gga age tec tac gga ttc 
Val Val Ser Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe 
10 2600 2605 2610 

caa tac tea cca gga cag egg gtt gaa ttc etc gtg caa gcg tgg aag 8228 
Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys 
2615 2620 2625 

tec aag aag ace ccg atg ggg ttc teg tat gat acc cgc tgt ttt gac 8276 
Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp 
2"0 2635 2640 ~ 2645 

tec aca gtc act gag age gac ate cgt acg gag gag gca att tac caa 
Ser Thr Val Thr Glu Ser Asp He Arg Thr Glu Glu Ala lie Tyr Gin 
2650 2655 2660 

tgt tgt gac ctg gac ccc caa gcc cgc gtg gcc ate aag tec etc act 
Cys Cys Asp Leu Asp Pro Gin Ala Arg Val Ala He Lys Ser Leu Thr 
2665 2670 2675 

gag agg ctt tat gtt ggg ggc cct ctt acc aat tea agg ggg gaa aac 842 0 
Glu Arg Leu Tyr Val Gly Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn 
JU ? 2680 2685 2690 



8324 



8372 



tgc ggc tac cgc agg tgc cgc gcg age ggc gta ctg aca act age tgt 8468 
Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys 
2695 2700 2705 

ggt aac acc etc act tgc tac ate aag gcc egg gca gcc tgt cga gcc 8516 
Gly Asn Thr Leu Thr Cys Tyr He Lys Ala Arg Ala Ala Cys Arg Ala 
2710 2715 2720 2725 

gca ggg etc cag gac tgc acc atg etc gtg tgt ggc gac gac tta gtc 8564 
Ala Gly Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 
2730 2735 ' 2740 

gtt ate tgt gaa agt gcg ggg gtc cag gag gac gcg gcg age ctg aga 8612 
Val He Cys Glu Ser Ala Gly Val Gin Glu Asp Ala Ala Ser Leu Arg 
2745 2750 2755 



gcc ttc acg gag get atg acc agg tac tec gcc ccc ccc ggg gac ccc 8660 

2770 



cca caa cca gaa tac gac ttg gag ctt ata aca tea tgc tec tec aac 8708 
Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn 
2775 2780 2785 

gtg tea gtc gcc cac gac ggc get gga aag agg gtc tac tac ctt acc 8756 
Val Ser Val Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr 
2790 2795 2800 ~ 2805 

cgt gac cct aca acc ccc etc gcg aga gcc gcg tgg gag aca gca aga 8804 
Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg 
2810 2815 2820 

cac act cca gtc aat tec tgg eta ggc aac ata ate atg ttt gcc ccc 8852 
CO His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Phe Ala Pro 
2825 2830 2835 
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65 



aca ctg tgg gcg agg atg ata ctg atg acc cat ttc ttt age gtc etc 8900 
Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser Val Leu 
2840 2845 2850 

ata gec agg gat cag ctt gaa cag get ctt aac tgt gag ate tac gga 894 8 
He Ala Arg Asp Gin Leu Glu Gin Ala Leu Asn Cys Glu He Tyr Gly 
2855 2860 2865 



gee tgc tac tec ata gaa cca ctg gat eta cct cca ate att caa aga 
Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Pro He He Gin Arg 
2870 2875 2880 2885 

etc cat ggc etc age gca ttt tea etc cac agt tac tct cca ggt gaa 
Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu 
2890 2895 2900 

ate aat agg gtg gee gca tgc etc aga aaa ctt ggg gtc ccg ccc ttg 
He Asn Arg Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro Pro Leu 
20 2905 2910 2915 



8996 



9044 



9092 



9140 



9188 



cga get tgg aga cac egg gee egg age gtc cgc get agg ctt ctg tec 
Arg Ala Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu Ser 
2920 2925 2930 

aga gga ggc agg get gee ata tgt ggc aag tac etc ttc aac tgg gca 
Arg Gly Gly Arg Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp Ala 
2935 2940 2945 

gta aga aca aag etc aaa etc act cca ata gcg gee get ggc egg ctg 923 6 
Val ^rg Thr Lys Leu Lys Leu Thr Pro He Ala Ala Ala Gly Arg Leu 
295 ° 2955 2960 2965 



gac ttg tec ggt tgg ttc acg get ggc tac age ggg gga gac att tat 
Asp Leu Ser Gly Trp Phe Thr Ala Gly Tyr Ser Gly Gly Asp He Tyr 
2970 2975 " 2980 

cac age gtg tct cat gee egg ccc cgc tgg ttc tgg ttt tgc eta etc 
His Ser Val Ser His Ala Arg Pro Arg Trp Phe Trp Phe Cys Leu Leu 
4U 2985 2990 2995 



9284 



9332 



ctg etc get gca ggg gta ggc ate tac etc etc ccc aac cga tga 9377 
Leu Leu Ala Ala Gly Val Gly. He Tyr Leu Leu Pro Asn Arg 
3000 3005 3010 

aggttggggt aaacactccg gectcttaag ccatttcctg tttttttttt tttttttttt 9437 

ttttttttct tttttttttt ctttcctttc cttctttttt tcctttcttt ttcccttctt 9497 

taatggtggc tccatcttag ccctagtcac ggctagctgt gaaaggtccg tgagcegcat 9557 

gaetgeagag agtgctgata ctggcctctc tgcagatcat gt 9599 



55 <210> 14 

<211> 3011 
<212> PRT 

<213> Hepatitis C virus 
60 <400> 14 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

1 5 10 is 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 
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25 



Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 ~ 80 

5 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 90 ~ 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
10 115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 

130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 

145 _ 150 155 160 

15 Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

Gln Va l Arg Asn Ser Ser Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
20 195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Ala He Leu His Thr Pro 

210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Val 
225 230 235 240 

Ala Val Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Thr Thr 

245 250 * 255 

Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser Ala Thr Leu Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Gly 
30 275 280 285 

G(Ln Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Asp Cys 

290 . 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
„ 305 310 315 ~ 320 

3D Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ala Gin 

325 330 335 

Leu Leu Arg He Pro Gin Ala He Met Asp Met He Ala Gly Ala His 

340 345 350 

Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp 
40 355 360 365 

Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu 

370 375 380 

Thr His Val Thr Gly Gly Asn Ala Gly Arg Thr Thr Ala Gly Leu Val 
A . 385 390 395 400 

4!> Gly Leu Leu Thr Pro Gly Ala Lys Gin Asn He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Ser Thr Ala Leu Asn Cys Asn Glu Ser 

420 425 430 

Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr Gin His Lys Phe Asn 
MJ 435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Arg Leu Thr Asp 

450 455 4 60 

Phe Ala Gin Gly Trp Gly Pro He Ser Tyr Ala Asn Gly Ser Gly Leu 
465 470 475 ^ 480 

Asp Glu Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Pro Cys Gly He 

485 490 "* 495 

Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tvr Ser 
60 515 520 ~ 525 

Trp Gly Ala Asn Asp Thr Asp Val Phe Val Leu Asn Asn Thr Arg Pro 
530 535 540 

Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly Gly Val Gly Asn 
565 570 * 575 
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Asn Thr Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

580 585 " 590 

Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp lie Thr Pro Arg Cys Met 

595 600 605 

Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr lie Asn Tyr 

610 615 620 

Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 

645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gin Trp 

660 665 670 

Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu lie His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

Val Gly Ser Ser lie Ala Ser Trp Ala He Lys Trp Glu Tyr Val Val 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ser Cys Leu Trp 

725 730 735 

Met Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

740 745 750 

He Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly Leu Val Ser Phe 

755 760 765 

Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly Arg Trp Val Pro 

770> 775 780 

Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp Thr Glu Val Ala Ala 

805 810 815 

Ser pys Gly Gly Val Val Leu Val Gly Leu Met Ala Leu Thr Leu Ser 

820 825 830 

Pro Tyr Tyr Lys Arg Tyr He Ser Trp Cys Met Trp Trp Leu Gin Tyr 

835 840 845 

Phe Leu Thr Arg Val Glu Ala Gin Leu His Val Trp Val Pro Pro Leu 

850 855 860 

Asn Val Arg Gly Gly Arg Asp Ala Val He Leu Leu Met Cys Val Val 
86 5 870 875 880 

His. Pro Thr Leu Val Phe Asp He Thr Lys Leu Leu Leu Ala He Phe 

885 890 895 

Gly Pro Leu Trp He Leu Gin Ala Ser Leu Leu Lys Val Pro Tyr Phe 

900 905 910 

Val Arg Val Gin Gly Leu Leu Arg He Cys Ala Leu Ala Arg Lys He 

915 920 925 

Ala Gly Gly His Tyr Val Gin Met Ala He He Lys Leu Gly Ala Leu 

930 935 940 

Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 ~ 960 

His Asn Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe 

965 970 975 

Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly Ala Asp Thr Ala Ala 

980 985 990 

Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala Arg Arg Gly Gin 

995 1000 1005 

Glu He Leu Leu Gly Pro Ala Asp Gly Met Val Ser Lys Gly Trp Arg 

1010 1015 1020 

Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu 
1025 1030 1035 1040 

Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 

1045 1050 1055 

Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr 

1060 1065 1070 

Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg 

1075 1080 1085 

Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val 
1090 1095 iioo 
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Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu 
1105 1110 1115 1120 

Thx Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His 

1125 1130 1135 

Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu 

1140 1145 " 1150 

Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 

1155 1160 ^ 1165 

Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu Phe Arg Ala Ala Val 

1170 1175 1180 

Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe lie Pro Val Glu Asn 
1185 1190 1195 1200 

Leu Gly Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro 

1205 1210 1215 

Pro Ala Val Pro Gin Ser Phe Gin Val Ala His Leu His Ala Pro Thr 

1220 1225 1230 

Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 

1235 1240 1245 

Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe 

1250 1255 1260 

Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro Asn lie Arg Thr 
1265 1270 1275 1280 

Gly Val Arg Thr lie Thr Thr Gly Ser Pro lie Thr Tyr Ser Thr Tyr 

1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He 

1300 1305 1310 

He He Cys Asp Glu Cys His Ser Thr Asp Ala Thr Ser He Leu Gly 

1315 1320 1325 

He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

1330 1335 1340 

Val /Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Ser His Pro 
1345 1350 1355 1360 

Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu He Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly Arg His Leu He 

1380 1385 " * 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Val 

1395 1400 1405 

Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 

1410 1415 1420 

Val He Pro Thr Ser Gly Asp Val Val Val Val Ser Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr 

1445 1450 1455 

Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He 

1460 1465 1470 

Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser Arg Thr Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr Arg Phe Val Ala Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr 

1525 1530 1535 

Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly Leu Pro Val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr His He 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Pro 

1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 ^ 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin 
1620 1625 1630 
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Asn Glu Val Thr Leu Thr His Pro lie Thr Lys Tyr lie Met Thr Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly 

1650 1655 1660 

Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Thr Gly Cys Val 
1665 1670 1675 1680 

Val He Val Gly Arg He Val Leu Ser Gly Lys Pro Ala He He Pro 

1685 1690 1695 

Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ser 

1700 1705 1710 

Gin His Leu Pro Tyr He Glu Gin Gly Met Met Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser Arg His Ala Glu 

1730 1735 1740 

Val He Thr Pro Ala Val Gin Thr Asn Trp Gin Lys Leu Glu Val Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Gly Gin Thr Leu Leu 

1795 1800 1805 

Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Ala Pro Gly 

1810 1815 1820 

Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala Gly Ala Ala He Gly 
1825 1830 1835 1840 

Ser Val Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu 

1860 1865 1870 

Val ?Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser 

1875 1880 1885 

Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg 

1890 1895 1900 

His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 " 1920 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He Leu Ser Ser Leu Thr 

1940 1945 1950 

Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He Ser Ser Glu Cys 

1955 I960 1965 

Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He Trp Asp Trp He 

1970 1975 1980 

Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu Lys Ala Lys Leu Met 
1985 1990 1995 ' 2000 

Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys Gin Arg Gly Tyr Arg 

2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly He Met His Thr Arg Cys His Cys Gly 

2020 2025 2030 

Ala Glu He Thr Gly His Val Lys Asn Gly Thr Met Arg He Val Gly 

2035 2040 2045 

Pro Arg Thr Cys Arg Asn Met Trp Ser Gly Thr Phe Pro He Asn Ala 

2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala Pro Asn Tyr Lys Phe 
2065 2070 2 075 2080 

Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val Glu He Arg Arg Val 

2085 2090 2095 

Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr Asp Asn Leu Lys Cys 

2100 2105 2110 

Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr Glu Leu Asp Gly Val 

2115 2120 2125 

Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro Leu Leu Arg Glu Glu 

2130 2135 ~ " 2140 

Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val Gly Ser Gin Leu 
2145 2150 2155 2160 
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Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His He Thr Ala Glu Ala Ala Gly Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His Asp Ser Pro Asp Ala 

2210 2215 2220 

Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

He Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe 

2245 2250 2255 

Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala 

2260 2265 2270 

Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg Ala Leu Pro Val Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Thr Trp Lys Lys Pro 

2290 2295 2300 

Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr Val Val Leu Thr 

2325 2330 2335 

Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu Ala Thr Lys Ser Phe 

2340 2345 2350 

Gly Ser Ser Ser Thr Ser Gly He Thr Gly Asp Asn Thr Thr Thr Ser 

2355 2360 2365 

Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp Ser Asp Val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser?Asp Gly Ser Trp Ser Thr Val Ser Ser Gly Ala Asp Thr Glu Asp 

2405 2410 2415 

Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu Val Thr 

2420 2425 2430 

Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro He Asn Ala Leu Ser Asn 

2435 2440 2445 

Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser 

2450 2455 2460 

Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu 
2465 2470 2475 2480 

Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val Lys Ala Ala Ala Ser 

2485 2490 2495 

Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala Cys Ser Leu Thr 

2500 2505 2510 

Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val 

2515 2520 2525 

Arg Cys His Ala Arg Lys Ala Val Ala His He Asn Ser Val Trp Lys 

2530 2535 2540 

Asp Leu Leu Glu Asp Ser Val Thr Pro lie Asp Thr Thr He Met Ala 
2545 2550 2555 2560 

Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro 

2565 2570 " 2575 

Ala Arg Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys 

2580 2585 2590 

Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro Leu Ala Val Met Gly 

2595 2600 2605 

Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu 

2610 2615 2620 

Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp 
2625 2630 2635 ~ 2640 

Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp He Arg Thr Glu 

2645 2650 2655 

Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala Arg Val Ala 

2660 * 2665 2670 

He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Leu Thr Asn 
2675 2680 2685 
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Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val 

2690 2695 2700 

Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie Lys Ala Arg 
2705 2710 2715 2720 

Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met Leu Val Cys 

2725 2730 2735 

Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly Val Gin Glu Asp 

2740 2745 2750 

Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala 

2755 2760 2765 

Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr 

2770 2775 2780 

Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala Gly Lys Arg 
2785 2790 2795 2800 

Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala 

2805 2810 2815 

Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie 

2820 2825 2830 

lie Met Phe Ala Pro Thr Leu Trp Ala Arg Met lie Leu Met Thr His 

2835 2840 ~ 2845 

Phe Phe Ser Val Leu lie Ala Arg Asp Gin Leu Glu Gin Ala Leu Asn 

2850 2855 2860 

Cys Glu lie Tyr Gly Ala Cys Tyr Ser lie Glu Pro Leu Asp Leu Pro 
2865 2870 2875 2880 

Pro lie lie Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser 

2885 2890 2895 

Tyr Ser Pro Gly Glu lie Asn Arg Val Ala Ala Cys Leu Arg Lys Leu 

2900 2905 2910 

Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala Arg Ser Val Arg 

2915 2920 2925 

Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala He Cys Gly Lys Tyr 

2930 2935 2940 

Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Ala 
2945 2950 2955 2960 

Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe Thr Ala Gly Tyr Ser 

2965 2970 2975 

Gly Gly Asp He Tyr His Ser Val Ser His Ala Arg Pro Arg Trp Phe 

2980 2985 2990 

Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val Gly He Tyr Leu Leu 

2995 3000 3005 

Pro Asn Arg 
3010 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 15 

gaaggaggga ggtttgaagg a 21 



<210> 16 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 
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<400> 16 

ccagttccgg gcaagaact 19 
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